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Robust blind methods using £ p quasi-norms 
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Abstract — It was shown in a previous work that some blind 
methods can be made robust to channel order overmodeling by 
using the l\ or i p quasi-norms. However, no theoretical argument 
has been provided to support this statement. In this work, we 
study the robustness of subspace blind based methods using l\ 
or tp quasi-norms. For the l\ norm, we provide the sufficient 
and necessary condition that the channel should satisfy in order 
to ensure its identifiability in the noise-less case. We then study 
its frequency of occurrence, and deduce the effect of channel 
parameters on the robustness of blind subspace methods using 
i\ norms. 



I. Introduction 

Despite being proposed several decades ago, blind methods 
failed so far to be commercially deployed for civil commu- 
nication products. This can be attributed to several practical 
difficulties, thereby limiting their use to signal processing 
applications where training cannot be used. Among the most 
difficult challenges that need to be addressed is the sensitivity 
of a number of blind methods to the errors of channel over- 
modeling, HI, 0. This is for example the case of conven- 
tional subspace methods which are known to exhibit a high 
sensitivity in case of channel order over-modeling, even in the 
noiseless case. Actually, in the noiseless case, the channel is 
identified as the vector that spans the one dimensional kernel 
of a certain matrix Q which depends on the second order 
statistics of the received signal. But, when the channel order 
is over-estimated, the kernel of matrix Q is no longer a line 
but rather a vector space whose dimensions depend on the 
over-estimated order. The issue that should be raised is thus 
how to choose the right direction among all the vectors that 
span the kernel of matrix Q. 

To deal with this problem, a large effort has been inves- 
tigated in either adding to sensitive methods a new feature 
that estimates efficiently the channel order [3| or proposing 
methods that are robust to channel order over-modeling. In this 
context, a new blind technique for sparse channel estimation 
has been recently proposed. To select the channel vector, this 
technique considers the joint utilization of a sparsity criterion 
which can be measured using a £ p quasi norm (0 < p < 1). It 
was noted by simulations that in this way, blind methods like 
cross relation and deterministic likelihood based techniques 
become robust to the over-estimation of the channel order H, 
0. For non-sparse channels, introducing likewise a sparsity 
criterion shall enhance the channel identifiability probability. 
The reason is that over-modeling the channel is equivalent to 
zero-padding the channel vector, which artificially becomes 
sparse. Moreover, in the case of blind subspace methods, it 
can be shown that selecting the right direction is equivalent to 
choosing the vector that maximizes its sparsity. 



In this work, we propose to study the robustness of subspace 
based methods using £ p quasi-norms for non-sparse channels. 
We derive necessary and sufficient condition for channel iden- 
tifiability when considering the £\ norm as well as a sufficient 
condition if the £ p quasi-norm is used. We then derive a lower 
bound on the probability that the necessary and sufficient 
condition holds. This lower bound allows us to study the 
effect of the system parameters on the channel identifiability 
probability. For instance, we prove that increasing the number 
of antennas improves significantly the channel identifiability 
probability, as opposed to increasing the number of channel 
coefficients, which tends to reduce it. 

We organize this paper as follows: we provide in section ITT1 
the channel model. Section [Til] is devoted to the derivation of 
the channel identifiability conditions. Finally, a probabilistic 
analysis is performed before providing in section |V] the 
simulation results. 

II. System model and Blind subspace methods 

Consider a SIMO communication link in which the receiver 
equipped with M antennas receives data stemming from a 
single antenna transmitter. The received vector at time k writes 
as: 



1=0 



-va- 



where h/ is the channel impulse response vector corresponding 
to the /-th tap and denotes the additive Gaussian noise 
vector. Define h = [hj, • • • ,h T L ] T be the channel vector. 

Stacking n observations of vector y in a (n + 1 )M vector 

y= frit'-- - >yL n ] T > we wil1 s et: 

J=J r „(h)s k +v k 

where S„(h) is the M{n+\) x (L + n + l) block-Toepltiz 
matrix: 

ho h L 

h h L 







ho 



Blind methods are based on the second order statistics of the 
received signal y n . Assuming that: 

A.l The transmitter symbols s# are independent and identi- 
cally distributed (i.i.d) with mean zero and variance 1, 

A.2 The undergone noise is white with variance a 2 , 

The covariance matrix of y n is given by: 

R = Ey„^ = ^„(h)^„(h) T + (T 2 I ( „ +1)M . 
Under the assumption that 
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A.3 The subchannels of vector h are real and have no zeros 
in common. Moreover, n > L and M > 1, 

The rank of J? „ (h) is equal to L + n + 1 . Matrix R has then 
exactly L + n + 1 eigenvalues which correspond to the signal 
subspace, whereas the remaining eigenvalues correspond to 
the noise subspace. It admits thus the following singular value 
decomposition: 

R = UAU T 



- (7 2 NN T 



where A is diagonal with p = L + n + 1 nonzero diagonal 
elements and U and N are unitary matrices which span 
respectively the signal and noise spaces. Define II = NN T the 
noise projector. In case of the absence of the noise term, the 
noise projector satisfies: 



u,y n {h) = o. 



(i) 



Let II = [tto, • • • ,Km\- Subspace based methods consists in 
searching the solution of the following equation: 



n^v(f) = o 



(2) 



If the number of coefficients L in f is accurately known, 
then there exists up to a constant term only one vector f of 
length L satisfying (f2]). If L' > L, then, the solutions of (fJJ 
belong to a vector space of dimension L' — L + 1 spanned by 
the columns of the following M(L' + 1) xL'-L+l Toeplitz 
matrix [ 1 , Lemma2] : 



III. Channel Identifiability Conditions 

It has been shown by using simulations in [5 1, that the use of 
l\ and l p norms can enhance the robustness of blind subspace 
methods. The aim of this paper is to provide theoretical argu- 
ments that account for the observed aspect. More explicitly, 
we will provide necessary and/or sufficient conditions that 
imply that the solution of problem (P p ) correspond to the zero- 
padded channel vector in the noiseless case, a case which we 
refer to as channel identifiability. 

Before that, we shall introduce some matrices depending on 
the channel, which will serve later to formulate the identifi- 
ability conditions. Since s\ has to be set to 1, problem (P p ) 
could be written as: 



minimize 1 1 h, + Hg j 



get 



(P P ) 



where h, = [h\,--- ,h[] T and H is the ML' x L' - L block- 
Toeplitz matrix having the same shape as H. Partition H as 



H 



B 



where A is formed by selecting the first ML rows of H whereas 
B correspond to the sub-matrix composed of the last M(L' — L) 
rows of H, i.e. A and B are given by: 



H 



h„ 







h 

Obviously, the channel vector h is almost surely the vector 
that exhibits the highest sparsity among all vectors in the range 
space of matrix H. In case the channel order is unknown, the 
channel vector can be thus recovered by solving the following 
optimization problem: 



minimize ||f||o 



subject to f = 



Hs 

= 1 



OPo) 



where the condition s\ = 1 is introduced to exclude the trivial 
zero solution (s\ refers to the first entry of vector s.). Solving 
{P^ requires an intractable combinatorial search, thereby re- 
ducing its interest for real-time applications. A common used 
way is to substitute the quasi norm £q which is non-convex 
and non-continuous by the l p quasi norm, thereby leading to 
the following optimization problem: 



minimize 
f 



If II 



subject to f = Hs 



(P P ) 



B 



A = 



hL h L _! 

h L 


h 
hi 



hn h L 2 



•i-5+1 



h L 




h„ 



A. Necessary and sufficient condition for l\ norm 

Unlike the £ p quasi-norm (p < 1), the l\ norm is convex. 
It is thus possible to derive necessary and sufficient condition 
that implies channel identifiability. 

Our theorem is stated as follows: 

Theorem 1. Necessary and sufficient condition 

Let v = [sign (hi ) r , • • • ,sign(hz,) r ] r and assume that L> L' — 
L > 1. Then the necessary and sufficient condition for channel 
identifiability can be expressed as : 



|v r Ag| 
l|Bg||i 



<i Vg e 



(3) 



where ||x||£ = \ Xi \P. 



Proof: See Appendix lAl 
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B. Sufficient condition for £ p quasi-norm 

Since the £ p quasi-norm is a non-convex function, the 
problem might have many local minima. Nevertheless, we still 
can find a sufficient condition that ensures that the channel can 
be identified as a local minimum of problem (P p ). The result 
is stated in the following theorem: 



Theorem 2. Let v = p 



sign(hi 



ihir 1 ' 



where • de- 



|_sign(h L )_ 

notes the Hadamard (element by element product). If the 
following condition is satisfied: 



|v r Ag| 
l|Bg||i 



<i Vg e 



tL'-L 



(4) 



Then, the channel can be identified as a local minimum of 
{P P )- 

Proof: See Appendix 151 ■ 
Remark Note that for each channel realization, there exists 
p such that the channel identifiability condition (0]i occurs. 

IV. Probabilistic Analysis 

The necessary and sufficient conditions being provided, a 
natural question that arises is how often they occur. Obviously, 
this will depend on the distribution of the channel vector 
elements which have to be chosen. We will consider next the 
assumption that the channel vector elements are drawn from 
the Gaussian distribution with zero mean and variance : 

A.4 The entries of the channel vectors hi,-- - ,h/, are real 
Gaussian with zero mean and variance 

To determine a lower bound on the channel identifiability 
probability, we will rely on the techniques derived in J6], 
Q. Actually, in the same way as in (6), we notice that 
the necessary and sufficient conditions in the i\ and £ p 
norm possesses an equivalent characterization as shown in the 
following lemma whose proof can be found in ||61 : 

Lemma 1. Let B be an n x N matrix with rank N. For any 
vector z define: 

ii ii l z?x l 
H-lls := sup jp^— |T- 

x/0 ll Bx l|l 

We have then the equivalent characterization : 



Mb-- 



under the constraint B'd = z 



Applying lemma [TJ conditions and (01 are equivalent to 
stating that the solution of the following problem : 

min||d||oo under the constraint B T d = A T v 

achieves a minimum value which is less than 1 . In other words, 
the necessary and sufficient conditions are satisfied if and only 
if there exists a vector d with ||d||<x> < 1 such that B T d = A T v. 

Since rank(B) = 8 almost surely, the channel identifiability 
will hold if the following conditions are satisfied: 

• The image of the cube by matrix B T contains a ball of 
radius a, 



• The vector A T v satisfies ||A T v||2 < cc. 
Let £P denote the probability that the channel identifiability 
holds, and E x a and be the events given by: 

E l a = {The image of the cube by B T contains a ball of radius a} 

E 2 a = {||A T v|| < a} 

Then, & can be lower-bounded as : 

^>p|U £ « n£ a| >maxP(£>n£2). 

Next, we will consider the computation of the lower bound 
for the t\ norm when 5=1. More explicitly, we show the 
following theorem: 

Theorem 3. For 5 = 1, the probability 3? that the necessary 
and sufficient condition (01 occurs is greater than: 

„ / . Me 2 A 1 ,\ M(l-e) 2 , 

£»>max 1-exp— — -=/- wI ) ( 5 ) 
ee[o,i] \ 7C J \J% 2 nL 

Proof: When 8 = 1, it is easy to note that the events E x a 
and E\ are independent. Therefore, 

P(4n4) = P(4)P(£2) 

On the other hand, v T A is a scalar Gaussian random variable 
with mean and variance j^. Therefore, 



|v T A| < a] 



|v'A| 2 fcH<a 2(L+1) 



LM 



LM 



1 (\ a 2 {L+l) 



^n J \2' 2 LA/ 



On the other hand, using standard concentration inequalities 
(See Theorem 2 in |8|), we have: 



\/L+T||h t ||i < (l-e)\/-M 
n 



Therefore, 



VX + T||h L ||i < (l-e)\l-M 
n 



< exp 



< exp 



e 2 M 



e 2 M 



As a consequence, 

IMi <(l-e)i 



-M 



< exp — 



£ 2 M 



7C(L+1) 

This gives thus the following probability lower bound : 

Me 2 



Nli>(l-e)l 



n{L + \) 



M 



> 1 — exp — 



Taking a = (1 - £)J-^pp^M, we get 



„ L Me 2 \\ 1 ,1 M(l-e) 2 . 
> max 1-exp —=y(-,—± '—) 



Interpretation : From the result shown above, one can note 
that as the number of antennas increases, the probability that 
the channel identifiability condition occurs tends to 1 . On the 
other side, the order of the channel L might tend to have the 
opposite effect : it tends to make it fail when it increases. 
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V. Simulation Results Since 

We present here simulation results for the l\ norm. Fig. Q] ||h + fAg||i +f||Bg|| 

displays the effect of the system parameters L and M on the f (^jg) — J >Q ^ 

lower bound probability that we have computed by maximiz- _ v T Ag+ ||By|| i 

ing numerically ([5j. Fig.[T]shows that increasing the number of 

antennas tends to enhance the channel identifiability, a result equation © implies that: 
which has been previously mentioned. |v T Ag| 

pgffT- ' 




Appendix B 
Proof Theorem|2] 

To prove theorem |2] it suffices to establish that when 
condition is satisfied, vector g = is a local minimum 
of the function 

/ p (g)H|h; + Hg||£=||h + Ag||£ + ||Bg||£ 

where h = [hf, • • • ,h|] T . In other words, we need to show that 
for all e > there exists a neighborhood in which all vectors 
g satisfy ||h + Ag||£ + ||Bg||£>|H|g. 

Let hi denote the i— th entry of h and e, be the vector with 1 
on the i— th position and elsewhere. Let p, = sign(/!,). Then, 
it can be easy to show that : 



Fig. 1. Impact of the system parameters L and M on the lower bound 
probability 



Appendix A 
Proof of TheoremQ] 

The proof of theorem Q] will rely on the following math- 
ematical results about the optimization of convex functions, 
which we provide for sake of completeness. 

Definition 1. Let f : MP i— > R be a real-valued function. The 
directional derivative of f at Xq is given by : 

f( X0 ,y) = M f{X0 + ty) ~ f{XQ) 
J y ' r>0 t 

Theorem 4. Let f be a convex function defined on a convex set 
X and xo iE X be a point where f is finite. Then xq is a global 
minimum point of f if and only of the following condition 
holds: 

/'(x ,x-xo)>0 Vxel 

We are now in position to establish theorem Q] 
Let / be the convex function defined as : 

/(g) = ||h z + Hg|| 1 . 

Then, /(g) can be also given by: 

/(g) = ||h + Ag|| 1 + ||Bg|| 1 



where h : 



It is clear that channel identifiability condition occurs if 
g = is a global minimum. By definition [U this implies that 



/(0,y)>0 V ye 



T>L!—L 



(6) 



h + eJAg 



\hi\+ pie] Ag 



Pie-Ag 



Thereby, the Taylor expansion of 
a neighborhood of can be given by : 



-eJAg 



v = \hj\p\\ i pp,e j Ag | p^' 1 ) l e ; A §l 2 



Hence, 



|/i, + eJAg|"-|/i,f- m -|/ !; |"- 1 eJAg 



when g lies in 

«(l|g|| 3 ) 



\h 



^0. 



pPi\hi\P-^}hg 

As a consequence, for all e > 0, there exists a neighborhood 
V e i of such that : 



-eJAg 



^l/^+^f-V/eJAg-Ep^f-^Ag 



We can assume without loss of generality that: 

LM _ 

^Pii^r^TAg^o. 

Because, otherwise, one can prove easily for e small enough, 
the existence of a neighborhood in which the following desired 
inequality holds 



||h+Ag||£+||Bg||£>||h||£. 

Since ifiVl^r VjAg = - |lg p^^^^Ag 
for all g G V £ = i ; ^ / i;V',, .,. 



, we get 



LM 



£ 1^-l+P.ejAg >I>r 



LM 



Zpihir'ptfAg 



LM 



epEl^r^leJAgl 



(7) 



On the other hand, as p < 1, one can easily check that 



|Bg||£ 



llBglli-e^I^I/^lP-'leJAgl g-+o 
Choose £ such that 

LM _ 

HBgll^e/^l^r^eJAgl. 

1=1 

This is possible since, with probability equal to one, ||Bgj| =0 
if and only if g = 0. Hence, there exists a neighborhood V £ 
such that for all g g V e , we have: 

LM _ 

l|Bg||?>||Bg|| I -ep^|/ I/ r 1 |eTAg| (8) 

i=l 

On the other hand, the channel identifiability condition (0]i 
implies that 



iBgii, > 



LM 



(9) 



Plugging ® into ©, we get: 



|Bg||£> 



LM 



i=l 



LM 



-ep^l^f-^eJAgl 



z=l 



This proves by using (0, that for all g € Ve HVg, we have: 

||h+Ag||£ + ||Bg||£>||h||£, 
thereby proving the result. 
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